A simulation study of confounding in generalized linear models for air pollution epidemiology.

نویسندگان

  • C Chen
  • D P Chock
  • S L Winkler
چکیده

Confounding between the model covariates and causal variables (which may or may not be included as model covariates) is a well-known problem in regression models used in air pollution epidemiology. This problem is usually acknowledged but hardly ever investigated, especially in the context of generalized linear models. Using synthetic data sets, the present study shows how model overfit, underfit, and misfit in the presence of correlated causal variables in a Poisson regression model affect the estimated coefficients of the covariates and their confidence levels. The study also shows how this effect changes with the ranges of the covariates and the sample size. There is qualitative agreement between these study results and the corresponding expressions in the large-sample limit for the ordinary linear models. Confounding of covariates in an overfitted model (with covariates encompassing more than just the causal variables) does not bias the estimated coefficients but reduces their significance. The effect of model underfit (with some causal variables excluded as covariates) or misfit (with covariates encompassing only noncausal variables), on the other hand, leads to not only erroneous estimated coefficients, but a misguided confidence, represented by large t-values, that the estimated coefficients are significant. The results of this study indicate that models which use only one or two air quality variables, such as particulate matter [less than and equal to] 10 microm and sulfur dioxide, are probably unreliable, and that models containing several correlated and toxic or potentially toxic air quality variables should also be investigated in order to minimize the situation of model underfit or misfit.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the use of generalized additive models in time-series studies of air pollution and health.

The widely used generalized additive models (GAM) method is a flexible and effective technique for conducting nonlinear regression analysis in time-series studies of the health effects of air pollution. When the data to which the GAM are being applied have two characteristics--1) the estimated regression coefficients are small and 2) there exist confounding factors that are modeled using at lea...

متن کامل

Model choice in time series studies of air pollution and mortality

Multicity time series studies of particulate matter and mortality and morbidity have provided evidence that daily variation in air pollution levels is associated with daily variation in mortality counts.These findings served as key epidemiological evidence for the recent review of the US national ambient air quality standards for particulate matter. As a result, methodological issues concerning...

متن کامل

Accuracy comparison of Elamn and Jordan artificial neural networks for air particular matter concentration (PM 10) prediction using MODIS satellite images, a case study of Ahvaz.

Due to the complexity of air pollution action, artificial intelligence models specifically, neural networks are utilized to simulate air pollution. So far, numerous artificial neural network models have been used to estimate the concentration of atmospheric PMs. These models have had different accuracies that scholars are constantly exceed their efficiency using numerous parameters. The current...

متن کامل

An Introduction to Epidemiologic and Statistical Methods Useful in Environmental Epidemiology

Many developments in the design and analysis of environmental epidemiology have been made in air pollution studies. In the analysis of the short-term effects of particulate matter on daily mortality, Poisson regression models with flexible smoothing methods have been developed for the analysis of time-series data. Another option for such studies is the use of case-crossover designs, and there h...

متن کامل

The severity of the relationship between daily air pollution and cardiovascular deaths in Ahvaz, Iran- using generalized additive models (GAMs) for seven years during March 2008 - March 2015

Abstract Background and objectives: Some epidemiological evidence has shown the relationship between environmental air pollution and adverse health effects. The aim of this study was to evaluate the effect of daily air pollution on daily cardiovascular mortality in Ahvaz city. Materials and Methods: In this ecological study, air pollution data was inquired from the Ahvaz Environmental Protectio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Environmental Health Perspectives

دوره 107  شماره 

صفحات  -

تاریخ انتشار 1999